February 4, 2000 (3:52PM) ID 700690 



V. CLAIMS 



1 1. A data processing device comprising: 

2 - at least one input for receiving data including 

3 - viewer profile data; and 

4 - data regarding a television program; and 

5 - a processor, the processor being adapted to perform the 
P6 following 

#7 - calculating a probability that the television program 

^=^^8 is a desired one; and 

[it : 

^'^9 - supplying a recommendation regarding the television . 

%0 program based on the probability. 

ii ? 

;f= 

El 2. The data processing device of claim 1 wherein the input is 
'''^2 coupled with a medium readable by the data processing device. 

1 3. The data processing device of claim 2, wherein the medium 

2 embodies the viewer profile. 

1 4. The data processing device of claim 3, wherein 

2 > the medium is local to the data processing device and 

3 > the viewer profile is arranged so as to be incrementally 
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4 updatable. 

1 5. The data processing device of claim 3, wherein the processor 

2 maintains the viewer profile in accordance with a data 

3 structure comprising: 

4 > a list of feature values; and 

5 > for each element of the list, a respective number of times 
rj6 programs having that feature value were watched. 

rifL 6. The data processing device of claim 5, wherein the data 

ry 

k2 structure further comprises, for each element of the list, a 

J, 3 respective number of times programs having that feature value 

m 

|=|j4 were not watched. 

i^jl 7, The data processing device of claim 6, wherein the processor 

2 is further arranged to perform the following, each time a 

3 user watches a new program, 

4 > first adding, to the list, feature values or counts of such 

5 feature values, associated with that new program; 

6 > selecting at least one companion program to the new 

7 program, the companion program being selected at random from 

8 a program schedule, which companion program has not been 

9 watched; and 
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10 > second adding, to the list, feature values of the companion 

11 program, or counts of such feature values. 

1 8. The data processing device of claim 5, wherein the processor 

2 is further arranged to perform the following, each time a 

3 user watches a new program: first adding, to the list, 

4 feature values or counts of such feature values, associated 

5 with that new program; 



The data processing device of claim 2, wherein the medium 
embodies the data regarding the television program. 

m 10. The data processing device of claim 1, wherein the input is a 

'if= network connection. 

("1 

11. The data processing device of claim 1, wherein calculating 
comprises using a Bayesian classifier. 

12. The data processing device of claim 11, wherein the 
processor is further adapted to subject the viewer profile to 
a noise threshold calculation prior to using the Bayesian 
classifier . 
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1 13. The data processing device of claim 12, wherein 

2 > the viewer profile data comprises 

3 > a list of feature values; 

4 > a respective negative count for each element of the list, 

5 the negative count indicating a number of times programs 

6 having that feature value have not been watched; 

7 > a respective positive count for each element of the list, 
IliB the positive count indicating a number of times programs 
1^ having that feature value have been watched; 

8b > the noise threshold calculation comprises 

ih. > selecting a sub-list comprising at least feature values 

%2 having at least one specific type of feature; 

I;' J 
-I ! 

:1J3 > choosing the highest negative count in the sub-list as the 

:!i;4 noise threshold; 

1"5 the recommendation comprises a program selected from a group 

16 having at least one feature value having a positive or negative 

17 count in the viewer profile, which count exceeds the noise 

18 threshold. 



1 14, The data processing device of claim 12, wherein subjecting 

2 the viewer profile to the noise threshold further comprises using 

3 observations gathered by a known random process to estimate a 

4 reasonable noise threshold. 
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15, The data processing device of claim 13/ wherein the specific 
type comprises a day and time of day feature type. 

16, The data processing device of claim 13, wherein the specific 
type comprises a station identification feature type. 

17, The data processing device of claim 1, wherein the viewer 
profile data comprises a plurality of respective counts of 
programs watched, each respective count indicating how many 
programs watched had a respective feature. 
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1 18. The data processing device of claim 11, wherein calculating 

2 comprises calculating a probability that the television 

3 program is in a particular class. 
4 

1 19. The data processing device of claim 18, wherein the class is 

2 one of 

3 > programs the viewer is interested in, and 
r.,A > programs the viewer is not interested in. 

hz~ 

I in 

;f= 

il 20. The data processing device of claim 1, wherein calculating 
l^Z the probability comprises: 

j; 3 - computing a prior possibility, of whether a program is 
nj4 desired or not; 

=f5 - computing a conditional probability of whether a feature fi 
p6 will be present if a show is desired or not; and 

7 - computing a posterior probability of whether program is 

8 desired or not, based on the conditional probability and the 

9 prior probability. 

1 21. The data processing device of claim 1, wherein it is assumed 

2 that programs watched are programs that the viewer is 

3 interested in. 
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1 22. The data processing device of claim 1, wherein the processor 

2 is further adapted to provide a recommendation regarding an 

3 additional item, other than a television program, based on the 

4 viewer profile. 

1 23. The data processing device of claim 1, wherein the processor 

2 is further adapted to occasionally recommend a surprise show 



'^==^3 that has relatively few features in common with watched shows. 

ik 

■its 

1 24. The data processing device of claim 1, wherein 

^'■"^ 2 > the viewer profile comprises a list of features types and 

;;;=[ 3 values for such feature types; 

4 > the feature types are selected from at least two sets, 

% 5 including 

6 > a first set of feature types whose values are deemed non- 

7 independent; and 

8 > a second set of feature types whose values are deemed 

9 independent; and 

10 > calculating a probability comprises 

11 > applying a Bayesian classifier calculation corresponding to 

12 feature types from the second set; and 

13 > applying a modified Bayesian classifier calculation 
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corresponding to feature types from the first set. 

25. The data processing device of claim 24, wherein 

> with respect to features of the first set, the modified 

Bayesian classifier calculation considers only feature values 
that match with a show being classified. 

26. At least one medium readable by a data processing device and 
embodying software arranged to perform the following 
operations: 

calculating a probability that a television program is 
a desired one, based on a viewer profile and data 
regarding the television program; and 
supplying a recommendation regarding the television 
program based on the probability. 

27. The at least one medium of claim 26, wherein the at least 
one medium further embodies the viewer profile. 

28. The at least one medium of claim 27, wherein the viewer 
profile is arranged so as to be incrementally updatable. 

29. The at least one medium of claim 27, wherein the viewer 
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2 profile is embodied as a data structure comprising: 

3 > a list of feature values; and 

4 > for each element of the list, a respective number of times 

5 programs having that feature value were watched. 

1 30. The at least one medium of claim 29, wherein the data 

2 structure further comprises, for each element of the list, a 
p3 respective number of times programs having that feature value 

m 

^ were not watched. 



110 

■^Ji 31. The at least one medium of claim 30, wherein the software is 

[2 further arranged to perform the following, each time a user 

ift watches a new program, 

A > first adding, to the list, feature values or counts of such 

A 

ff> feature values, associated with that new program; 

6 > selecting at least one companion program to the new 

7 program, the companion program being selected at random from 

8 a program schedule, which companion program has not been 

9 watched; and 

10 > second adding, to the list, feature values of the companion 

11 program, or counts of such feature values. 



1 



32. The at least one medium of claim 29, wherein the software 
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2 is further arranged to perform the following, each time a user 

3 watches a new program: first adding, to the list, feature 

4 values or counts of such feature values, associated with that 

5 new program. 



33 The at least one medium of claim 26, wherein the at least 
one medium embodies the data regarding the television 
program. 



comprises using a Bayesian classifier. 



34 The at least one medium of claim 2 6, wherein calculating 
i^l 35 The at least one medium of claim 34, wherein the software is 



further adapted to subject the viewer profile to a noise 
threshold calculation prior to using the Bayesian 
classifier . 



1 36 The at least one medium of claim 35, wherein 

2 > the viewer profile data comprises 

3 > a list of feature values; 

4 > a respective negative count for each element of the list, 

5 the negative count indicating a number of times programs 

6 having that feature value have not been watched; 
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7 > a respective positive count for each element of the list, 

8 the positive count indicating a number of times programs 

9 having that feature value have been watched; 

10 > the noise threshold calculation comprises 

11 > selecting a sub-list comprising at least feature values 

12 having at least one specific type of feature; 

13 > choosing the highest negative count in the sub-list as the 
fi4 noise threshold; 

m 

,p5 > the recommendation comprises a program selected from a group 

m 

i.il6 having at least one feature value having a positive or negative 

ru 

■^7 count in the viewer profile exceeding the noise threshold. 

The data processing device of claim 35, wherein subjecting 
the viewer profile to the noise threshold further comprises 
using observations gathered by a known random process to 
estimate a reasonable noise threshold. 



37 



38, The at least one medium of claim 36, wherein the specific 
type comprises a day and time of day feature type. 

39. The at least one medium of claim 36, wherein the specific 
type comprises a station identification feature type. 
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40. The at least one medium of claim 2 6, wherein the viewer 
profile data comprises a plurality of respective counts of 
programs watched, each respective count indicating how many 
programs watched had a respective feature. 

41. The at least one medium of claim 40, wherein calculating 
comprises calculating a probability that the television 
program is in a particular class. 

42. The at least one medium of claim 40, wherein the class 
comprises at least one of programs the viewer is interested 
in and programs the viewer is not interested in. 

43. The at least one medium of claim 26, wherein calculating the 
probability comprises: 

computing a prior possibility, of whether a program is 
desired or not; 

computing a conditional probability of whether a feature fi 
will be present if a show is desired; and 
computing a posterior probability of whether program is 
desired or not, based on the conditional probability and the 
prior probability. 
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1 44, The at least one medium of claim 26, wherein it is assumed 

2 that programs watched are programs that the viewer is 

3 interested in. 

1 45. The medium of claim 2 6, wherein the software is further 

2 arranged to provide a recommendation regarding an additional 
item, other than a television program, based on the viewer 

'.Its 

% profile. 

m 

rli 

46. The at least one medium of claim 26, wherein the software is 
'2 further arranged to occasionally recommend a surprise show 

1:3$ that has relatively few features in common with watched show. 

i !=■ 

47. The at least one medium of claim 2 6, wherein 

2 > the viewer profile comprises a list of features types and 

3 values for such feature types; 

4 > the feature types are selected from at least two sets, 

5 including 

6 > a first set of feature types whose values are deemed non- 

7 independent; and 

8 > a second set of feature types whose values are deemed 

9 independent; and 
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10 > calculating a probability comprises 

11 > applying a Bayesian classifier calculation corresponding to 

12 feature types from the second set; and 

13 > applying a modified Bayesian classifier calculation 

14 corresponding to feature types from the first set. 

1 48. The at least one medium of claim 47, wherein with respect to 

i>»^2 features of the first set, the modified Bayesian classifier 

t=3 calculation considers only values that match with a show being 

i=j^4 classified. 

nl 



1 49. A computer method comprising performing the following 

i^j2 operations in a data processing device: 

^3 - Receiving a set of data; 

u4 - Filtering the data in accordance with a noise criterion; 

5 - Drawing a conclusion from the filtered data based on a 

6 Bayesian classifier calculation; and 

7 - Presenting the conclusion to a user. 



1 ■'50. The method of claim 49 wherein the noise criterion is based 

2 on a frequency of instances of a particular types of data 

3 within set, which types are believed to likely represent 

4 noise. 
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51. A data processing method comprising performing the following 
operations in a data processing device: 

first receiving data reflecting physical observations, which 
data includes a list of feature values and observations 
about feature values, some of which feature values are 
independent and some of which are not; 

second receiving data about an item to be classified, the 
data about the item to be classified including feature 
values; 

maintaining a division of the data reflecting physical 
observations into at least two sets, including 

a first set including those feature values which are 

deemed not independent; and 

a second set including those feature values which are 
deemed independent; 
performing a probabilistic calculation on the data 
reflecting physical observations and the data regarding an 
item to be classified including: 

applying a Bayesian classifier calculation with respect 
to feature values relating to the second set; and 
applying a modified Bayesian classifier calculation 
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with respect to feature values relating to the first 
set 

presenting a conclusion regarding the item to be classified 
to a user based on the probabilistic calculation. 

52. The method of claim 51, wherein the modified Bayesian 
classifier calculation comprises ignoring feature values from the 
data reflecting physical observations when those feature values 
are not present in the data regarding the item to be classified . 
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